Exploring Uncertainty Measures for Image-caption Embedding-and-retrieval Task

نویسندگان

چکیده

With the significant development of black-box machine learning algorithms, particularly deep neural networks, practical demand for reliability assessment is rapidly increasing. On basis concept that “Bayesian knows what it does not know,” uncertainty network outputs has been investigated as a measure classification and regression tasks. By considering an embedding task task, several existing studies have quantified embedded features improved retrieval performance cutting-edge models by model averaging. However, in image-caption embedding-and-retrieval tasks, well-known samples are always easy to retrieve. This study shows method poor investigates another aspect We propose posterior which can accurately assess results. The consistent two measures observed with different datasets (MS-COCO Flickr30k), deep-learning architectures (dropout batch normalization), similarity functions. To best our knowledge, this first perform on

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

evaluation of dissimilarity measures for image retrieval and classification

in this paper, the performance of 11 different distances for image retrieval and classification, based on color, shape and texture, is evaluated. the precision-recall measure and the correct classification rate of the k-nn classifier are used to evaluate retrieval and classification performances, respectively. the experimental results for a database of 1000 images from 10 different semantic gro...

متن کامل

Caption vs. Query Translation for Cross-Language Image Retrieval

For many cross-language retrieval tasks, the predominant approach is to translate the query into the language of the document collection (target language). This often gives results as good as, if not better, than translating the document collection into the query language (source language). In this paper, we evaluate query versus document translation for the ImageCLEF 2004 bilingual ad hoc retr...

متن کامل

Shape measures for image retrieval

One of the main goals in Content Based Image Retrieval (CBIR) is to incorporate shape into the process in a reliable manner. In order to overcome the difficulties of directly obtaining shape information (in particular avoiding region segmentation) we develop several shape measures that tackle the problem in an indirect manner, requiring only a minimal amount of segmentation. A histogram-based s...

متن کامل

ISIA at the ImageCLEF 2017 Image Caption Task

This paper describes the details of our methods for participation in the caption prediction task of ImageCLEF 2017. The dataset we use is all provided by the organizers and doesn’t include any external resources. The key components of our framework include a deep model part, an SVM part and a caption retrieval part. In deep model part, we use an end to end architecture with Convolutional neural...

متن کامل

» research note « evaluation of dissimilarity measures for image retrieval and classification

in this paper, the performance of 11 different distances for image retrieval and classification, based on color, shape and texture, is evaluated. the precision-recall measure and the correct classification rate of the k-nn classifier are used to evaluate retrieval and classification performances, respectively. the experimental results for a database of 1000 images from 10 different semantic gro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Multimedia Computing, Communications, and Applications

سال: 2021

ISSN: ['1551-6857', '1551-6865']

DOI: https://doi.org/10.1145/3425663